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1 Introduction 



Convolutional codes are a class of error-correcting codes that have enjoyed wide use in prac- 
tical applications due to the existence of efficient non-algebraic decoding algorithms. From 
a mathematical standpoint, however, the situation is still rather unsatisfying, as there are 
relatively few algebraic constructions of convolutional codes with provably good distance 
properties or which can be algebraically decoded. Recent years have seen interesting de- 
velopments in the algebraic theory of convolutional codes: the papers [3 |Hl CHI HH US] 
extend the notion of cyclicity familiar from block code theory to convolutional codes; the 
papers O ^] investigate weight enumerators and the existence of a MacWilliams Identity 
for convolutional codes; the paper ^H] uses methods from systems theory to construct con- 
volutional codes having a designed distance; and the papers jHl IHl CHI CHI 121] provide results 
concerning convolutional codes having certain maximal distance properties. Motivated by 
existence results appearing in certain members of this last set of papers, we decided to inves- 
tigate so-called superregular matrices. These matrices arise when one considers the problem 
of constructing convolutional codes having a maximum distance profile. 

The remainder of this paper is structured as follows. In Section 2, we give a brief 
introduction to convolutional codes, explain the maximal distance properties mentioned 
above, and define the superregularity property. In Section 3, we discuss how superregular 
matrices may be used to construct codes having a maximum distance profile. In Section 4, 
we introduce group actions that preserve the superregularity property; these actions made it 
possible to reduce the computation time necessary for performing searches for superregular 
matrices. Finally, in Section 5, we present an upper bound on the minimum field size required 
for a superregular matrix of a given size to exist. 

2 Preliminaries 

In this section, we review the theory of convolutional codes relevant to the presented work. 
Let k and n be positive integers with k < n. Let F be a finite field of characteristic p, where 
p is a prime number. 

Definition 2.1 A convolutional code C of rate k/n is a. rank-/c submodule of the free module 

Since F[s] is a principal ideal domain, C is a free module and has a well-defined rank. It 
follows that a convolutional code may be viewed as the column space of an n x /c polynomial 
matrix G{s), the columns of which form an F[s]-basis for C. As a set, we have 

C = {v{s) = G{s)u{s) e (F[s])" I u{s) e (F[s])'=}. 

G{s) is called a generator matrix ioi C, the u{s) are called information vectors, and the v{s) 
are called code vectors or codewords. Two generator matrices and G2{s) having full 
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column rank generate the same code if and only if there exists a k x k unimodular matrix 
U{s) such that Gi{s) = G2{s)U{s). 

The columns of G{s) may be thought of as polynomials with coefficients in F"; we refer 
to the degrees of these polynomials as the column degrees of G{s) and denote the degree of 
the ith. column by /ij. Let m := maxi<j<fc{/ij}. Thinking of (F[s])*^"^'^^ as F'^"^'^^[s], we may 
then expand G{s) into a matrix polynomial, 

= Go + Gis + ■ ■ ■ + 

where Gj is an n x A; matrix over F, the entries of which are the coefficients of in G{s). 
Similarly, thinking of (F[s])'^ as F*^[s], we may expand u{s) G (F[s])'^ of degree / into a vector 
polynomial: 

u{s) = uo + UlS H h uis''. 

We may then represent the encoding process with the multiplication 



Vl 
Vl+n 



Go 
Gi 

Gm-l 
Gm 











Go 
Gi 

Gm-l 

Gm. 









Go 
Gi 

Gm-l 
Grr,. 



Uo 
Ml 



where represents the n x k matrix with all entries zero. The large matrix in the middle 
is called a sliding generator matrix. From this representation, we see the origin of the name 
convolutional code: the vector coefficients of the information vector are convoluted with the 
matrix coefficients of G(s) to form the codewords. 

A generator matrix G(s) is called basic if the only common divisors of the determinants of 
its X A; submatrices belong to F\{0}. There are several characterizations of this property; 
more details may be found in jTTl |^ Ej • We note here two of these characterizations. The 
first is that a convolutional code generated by a basic generator matrix is a direct summand 
of F"[s]. The second is that there exists a basic n x [n — k) matrix over F[s], H{s), such 
that C is the right F[s]-kernel of H^{s). Thus, an observable code may also be described as 
the set 

C = {v{s) e (F[s])" I H'^isHs) = [0 ■ ■ ■ 0]^ G i¥[s]f} 

H{s) is called a parity check matrix of C. As with the generator matrix, one may expand 
H(s) as the sum 

H{s)=Ho + H,s + --- + Hm's"'\ 
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where m' is the largest integer such that Hm' 7^ 0. A convolutional code C is called observable 
if one (and hence every) generator matrix of C is basic. 

The high-order coefficient matrix Goo of G{s) is a matrix, the ith column of which is the 
vector coefficient of s^' in column i. A basic generator matrix for which Goo has full rank is 
called minimal. An important invariant of a convolutional code is its complexity, defined as 
follows: 

Definition 2.2 The complexity 5 of a convolutional code C is the maximum of the degrees 
of the (polynomial) determinants of the k x k submatrices of any generator matrix of C. 

This definition makes sense, as the equivalence relation described above preserves the degrees 
of these determinants. A code of rate k/n and complexity 6 will also be referred to as an 
(n, k, 5)-code. If the code is observable, the complexity is normally referred to as the degree 
of the code; see for the geometric motivation for this terminology. 

In general. Goo need not have full rank; in this case, Yli=i f'i > ^- shown in jlj that it 
is always possible to find a unimodular matrix U{s) such that G{s)U{s) has a full-rank high- 
order coefficient matrix and a set {i/jj^^i of column degrees satisfying z/i < z/2 < ■ ■ ■ < i^fc and 
^i^i = 5. In this case, the column degrees are invariants of the code generated by G(s) 
and are called the column indices or the Kronecker indices of the code. If G(s) is minimal, 
the column degrees z^i < z>'2 < ■ ■ • < '^fc mentioned above are called the minimal column 
indices or the Forney indices of the code. For the remainder of the paper, we assume all 
codes to be observable and generator matrices to be minimal. 

The following truncated sliding generator matrices G^ G F*^-'"''^^"^*^-''^^)'^ and parity check 
matrices if| G F'^-'"^^)"'^^-'"*"^^'^""'^^ will be of importance in this work. These are defined for 
each j G No as 

Go 
P> 

where Gj = {Hj = 0) if j > m (j > m'). The relation H^{s)G{s) = immediately implies 
that {H^jfG'j = for all 3 G Nq. 

We turn now to some distance notions for convolutional codes. 

Definition 2.3 Let x G F". The Hamming weight of x, wt(x), is the number of nonzero 
components of x. Let v{s) G F"[s] be given by v{s) := vo+vis+- ■ ■+vis^ for some nonnegative 
integer /. The weight of v{s), wt(f (s)), is the sum of the weights of its F^-coefficients: 

I 

wt(t;(s)) := wt(fi). 

i=0 

We then have 





Gq 

Gj^i 
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Go 
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(2.1) 



4 



Definition 2.4 Let C be an (n, k, S)-code. Then, the free distance of C, dfree{C), is 




nin {wt(t'(s)) | v{s) ^ 0}. 
s)ec 



The following theorem gives an upper bound for how large the free distance of an (n, A;, S)- 
code can be: 

Theorem 2.5 Let C be an {n,k,5)-code. Then, dfree{C) satisGes 



This bound is proven in ^Hl- It is known as the generalized Singleton bound. The reason for 
this is that a convolutional code of complexity is a block code, as it has a generator matrix 
with all entries in F. If we set 5 = in the above expression, it reduces to the Singleton 
bound from the theory of block codes. In analogy with the block code case, convolutional 
codes having a free distance meeting the generalized Singleton bound are called maximum 
distance separable (MDS). 

The free distance is a global distance measure and determines the maximum number of 
errors that may be introduced to a codeword without jeopardizing correct decoding. A more 
local distance measure, relevant to the performance of sequential decoding algorithms (see, 
for example, [IE!), is given by column distances. These are defined as follows: 

Definition 2.6 Let C be a convolutional code. For j G Nq and a polynomial vector v{s) G 
F"[s] of degree /, set V[oj]{s) := vq + vis + ■ ■ ■ + VjS^ (where Vj = if j > /). Then, the jth 
column distance of C, d'^JC), is defined as 



Because of the assumption that C is observable, the fact that f o 7^ means that the minimum 
is taken over codewords resulting from information vectors with uq 7^ 0. It is easy to see 
that the column distances satisfy 



The (m + l)-tuple of numbers ((io(C), dl{C), . . . , d'^{C)) is called the column distance profile 
of the code [H P- 112]. 

The following theorem gives an upper bound for the jth column distance: 

Theorem 2.7 Let C be an (n, k, 6)-code. For every j G Nq, we have 

d^^(C)<in-k)ij + l) + l. 

If d'^AC) = {n — k){j + 1) + 1 for some j, then d'i{C) = {n — k){i + 1) + 1 when i < j. 




d'jiC) : 



min {wt(w[oj](s)) | Vq ^ 0}. 



d^oiC) < dl{C) <dl{C)...< lim d^'AC) = dfreeiC). 



(2.2) 
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A proof may be found in . By considering the bound in Theorem \2.5\ one easily sees 
that the largest integer for which the upper bound in Theorem 12.71 can be attained is L := 
LfJ + L;r^J- {n, k, 5)-code C is said to be maximum distance profile (MDP) if d^C) = 
(L + l){n — k) + 1; note that Theorem 12.71 implies that d'^{C) = (j + l){n — k) + 1 for all 
j G {0, 1, . . . , L}. In other words, the column distances of an MDP code are maximal for as 
long as possible. 

We close this section by introducing superregular matrices; in Section 2, we will see their 
relevance to the construction of MDP codes. 

Definition 2.8 Let A be the (7 + 1) x (7 + 1) lower triangular Toeplitz matrix 

ao ■■■ 0' 

CLl CLq ' ■ '. 

': ■•. ■•• 

Qi^ • • • di CLq 

Let s G {1, 2, . . . , 7 + 1}. Suppose that / := {ii,...,is} is a set of row indices of A, 
J := {ji, . . . ,js} is a set of column indices of A, and that that the elements of each set 
are ordered from least to greatest. We denote by the submatrix of A formed by 

intersecting the columns indexed by the members of J and the rows indexed by the members 
of /. A submatrix of A is said to be proper if, for each u G {1, 2, ... , s}, the inequality 
ju < holds. The matrix A is said to be superregular if every proper submatrix of A has a 
nonzero determinant. 



Remark 2.9 Observe that the proper submatrices of A are the only ones that can possibly 
have a nonzero determinant: If > i^ for some z/, then, in the submatrix A^-^''"'*'^, the upper 
right block consisting of the first u rows and the last s — u + 1 columns contains only zero 
entries. Hence, the matrix formed from the first u rows of can have rank at most 

u — 1. For example, if 7 > 4, we can consider the submatrix 



^1.2,5 
"^1,3,4 



ao 
ai 

Cii 






02 






ai_ 



This submatrix clearly has a zero determinant regardless of what ao, cii, 02, and 04 are. 
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3 A Construction of MDP Codes from Superregular 
Matrices 



In this section, we take a closer look at the transposes (Hj)'^ G F*^-'"*"-^-"^" k)x{j+i)n ^^iq 
matrices that were introduced in ()2.1|) : 



We then have the following definition: 



HI 



Hj Hj_^ 



HI 



Definition 3.1 (Hj) is said to have the maximum span property if none of the first n 
columns of {H'^Y^ is contained in the span of any other (j + l){n — k) — 1 columns of (Hj)'^. 

We note here that, of course, each of the first n columns of {Hj)^ is in the span of some 
set of (j + l){n — k) columns of [Hj)'^. We also make the observation that, if C is an 
(n, k, 5)-code and (Hj)'^ the transpose of its jth truncated sliding parity check matrix, then 



[n- 



k){j + 1) + 1 if and only if (Hj) has the maximum span property. In particular, 
C is an MDP code if and only if {HD^ has the maximum span property. 

Since H{s) is basic, we may assume without loss of generality (for the purpose of what 
follows) that the (n — fc) x (n — k) submatrix of Hq consisting of the first n — k columns 
has full rank. In this case, left multiplication by an invertible matrix followed by a column 
permutation gives the matrix 



1 
1 

••• 



... 

■•. 

1 



^0 
Hi Hq 



Hj Hj_i 






Ho 



n—k) 



H 



Recalling Definition 3.1, we will also say that any matrix having the same form as (Hj)'^ has 
the maximum span property if none of the first k columns of H is contained in the span of 
any other [n — k){j + 1) — 1 columns of {Hj)'^. Note that if (Hj)'^ has the maximum span 

property, then so does (Hj)'^. The next theorem is a simplified version of jHl Theorem 3.5]; 
it relates the superregularity property to the maximum span property: 

Theorem 3.2 Let T be a {j + 1) x (j + 1) lower triangular Toeplitz matrix: 

'to ■■■ O" 



T :-- 



ti to 



tj tj^i 





to 



(3.3) 
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Then, T is superregular if and only if the matrix [Ij+i \ T] has the maximum span property. 

Given k, n, and j, we will now see how to use an appropriately sized superregular matrix to 
construct a matrix which has the form of (Hj)'^ and the maximum span property (for more 
details, see [HI Appendix C]): 

Theorem 3.3 Let T be an I x I superregular matrix, where I > {j + l){n — 1). Let T' the 
submatrix obtained from T by intersecting the rows indexed by {k, A; + l,...,n — 1,72 — 1 + 
k,n - 1 + k + 1, . . . ,2{n - 1), . . . ,j{n - 1) + k, j{n - 1) + k + 1, . . . ,j{n - 1) + n - 1} and 
columns indexed by {1,2, ... ,k,n — 1 + l,n — 1 + 2, ... ,n — 1 + k, ... ,j{n — 1) + l,j{n — 
1) + 2, . . .,j{n - 1) + k}. Then, V G wi3+^)(n-k)x(j+i)k ^ j^^^^ j^j^^j^ triangular Toeplitz 
matrix such that | T'] has the maximum span property. 

If we let j = L, we may use the resulting matrix to construct a parity check matrix of an 
MDP convolutional code. 

Example 3.4 In this example, we consider the following matrix over F64: 



1 

























1 






















UJ 


1 























1 




















UJ 


1 

















UJ^ 


UJ 


1 








OJ 




CU33 


u;33 


UJ^ 


UJ 


1 





1 


UJ 


00^ 


UJ^' 


CU33 


UJ^ 


UJ 


1 



Here, is a root of + x + 1 G F2[x] and is thus a primitive field element. According 
to [H], this matrix is superregular. We let = 2, n = 3, and 6 = 2. Then, L = 3, and, since 
{L + l)(n — 1) = 8, we may use Theorem 13.31 to form the matrix 



1 











UJ 


1 























1 








^33 


UJ^ 


UJ 


1 




















1 





UJ^ 


UJ-'' 


^33 


UJ^ 


UJ 


1 

















1 


1 


UJ 




Uj'' 


Uj'' 


Uj' 


UJ 


1 



which has the maximum span property. We may then use this matrix to construct a polyno- 
mial generator matrix for an MDP (3, 2, 3) code. We could, for example, think of the 1x2 
matrices making up the first block column of the right-hand side of this matrix as a finite 
sequence of Markov parameters and compute a minimal partial realization of this sequence 
(see, for example, ^J). We would then compute the corresponding transfer function. After 
some small algebraic manipulations and fixing an ordering for the codeword components, 
we would arrive at a polynomial generator matrix and its corresponding convolutional code 
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(see jin] for more information about the relationship between the hnear systems and polyno- 
mial representations of convolutional codes). In this example, one possibility for a resulting 
generator matrix is 







^2 + ^57^ + ^62 



4 Group Actions Preserving the Superregularity 
Property 

In this section, we consider group actions that allow one to create new superregular matrices 
from a given one. The main utility of these actions is that they reduce the number of matrices 
one must check when performing a computer search for matrices having the superregularity 
property. 

For a prime power and a nonnegative integer 7, denote by SR{p^, 7) the set of super- 
regular matrices of dimension 7 + 1 over F^e. The first group action is a corollary of the 
following result, which is proven in 0: 

Theorem 4.1 Suppose that A G SR{p%^). Then, A^^ G SR{p%^). 

Corollary 4.2 There is an action *i of the additive group (Z, +) on the set SR{p^, 7), given 
hy 

*i : ZxSR{p%-f) — > SR{p',-f) 
{x,A) ^ 

Proof: By Theorem \4.1\ A^^ is superregular, and it is clear that *i is a group action. □ 
We next have the following simple result: 

Lemma 4.3 If A e SR{p'','j) and 7 > 2, then A^ A'^. 
Proof: Suppose 



A 



ao 



... 
■•. 



is a superregular matrix, where 7 > 2. Suppose in addition that A = A~^. Then, since 
A"^ = /-Y+i, we must have that a^ai +aiaQ = 2aoOi = 0. By hypothesis, oq, cii 7^ 0. Thus, the 
field must have characteristic 2. We must also have that 0200 + + 00*^2 = 2aoa2 + of = 0. 
Since the field has characteristic 2, this equation reduces to = 0. This is a contradiction, 
as ai ^ 0. Thus, A ^ A'K □ 



Corollary 4.4 If 7 > 2, then ISR^p^,^)] is even. 
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We next describe an action of the multiplicative group F*e := Fpe\{0} of nonzero field 
elements of F^e on SR{p^,^): 



Theorem 4.5 Let a G F*e. Define 



a* A 



aai 
o?a2 



••. ••. 

Op'a2 OLOi\ Qq 



Then, the map 



*2: ¥;.xSR{p^,^) - 
{a, A) ^ 

is an action ofF*e on SR{p'\j). 

Proof: It is readily verified that *2 is a group action. Let 

"l ••• 



5i^(p^7) 

a»A 



D :-- 



a 







■-. 

q;T 



One can then describe *2 through conjugation hj D: a • A = DAD^^. This changes the 
determinants of submatrices of A only by factors of powers of a, and it follows that a» A is 
also superregular. □ 
The final group action we consider makes use of the Galois group Autvp¥pe. We recall 
here the fact that Autp^Fpe — (a), where a e Aut^^Fpe is the Fp-automorphism defined by 
a{x) — E ¥pe. Consequently, Autw^^Wpe = Z/eZ. 

Theorem 4.6 Let A e SR{p^,'y). Let i G Z/eZ. Dehne 

a^' 



ioA:^ 



p 



p" p" p" 

Ci'2 Ojq 



Then, the map 

*3: Z/eZ X SR{p%-f) — > SR{p%^) 
{i,A) I — )• ioA 

is an action of the additive group Z/eZ on SR{p^, 7). 
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Proof: It is readily verified that *3 is a group action. As the entries of A belong to a 
field of characteristic p, the determinant of a submatrix of i o A is the determinant of the 
corresponding submatrix of A raised to the p'th power. Thus, i o Ais also superregular. □ 

5 An Upper Bound for the Required Field Size 

An important question to consider in trying to better understand superregular matrices is 
that of how large a finite field must be in order that a superregular matrix of a given size 
can exist over that field. For example, no 3 x 3 superregular matrix exists over the field 
F2: all entries in the lower triangular part of a superregular matrix must be nonzero, which 
means that in this case all such entries would have to be 1; clearly, this does not result in 
a superregular matrix, since the lower left 2x2 submatrix has a zero determinant. In this 
section, we give an upper bound on the required field size. We begin with a technical lemma: 

Lemma 5.1 Let i be a nonnegative integer and 7 a positive integer. Let Si+i denote the 
set of integer sequences {si}]^l, where Sq = 0, Sj+i = 7, and si < s;+iV/ e {0,1, 
Let Si^j :— {{siYi'tl G S'j+i with sj + Si-j+i < 7, j = 0, 1, ... , [|]}. Let Tj+i denote the 
set of integer sequences {tiYiZl, where to — 0, ti+i — 7, and ti < VZ e {0, 1, . . . Let 
Ti,j := {{ti}itl e Ti+, with E"o(-l)'(^m - i/) > 0, m = 0, 1, . . . Then, S^,^ and T,,^ 
are finite sets, and \Si^^\—\Ti^j\. 

Proof: It is clear that Si^j and Ti^j are finite sets. We now proceed to prove the second 
part of the claim. We will do this by first constructing an injective map from Si^^ to Tj_-y 
and then an injective map from Tj,y to Si^^. Throughout the proof, t-i and s_i are defined 
to be 0. Suppose first that i is even. Let {siYi^l ^ ^hj- J £ {0, 1, ■ ■ ■ , |}, we make the 
recursive definitions 

t2j+l '■= t2j + Sj_j+i — Si-j, 
t2j '■= t2j-l + Sj — Sj-i. 

Note that to — sq — 0. Then, for m e {0, 1, . . . , i}, we have 

m 

j2{-iyiti+i-ti)^ti- 

1=0 

= 2ti 
= 2ti 



- {t2 - ti) + {t3-t2) + {-intm+l - tm) 

- 2t2 + 2t3 + (-l)"Wi 

— 2^2 + 2^3 — • • • — t2j 

or 

— 2^2 + 2^3 — • • • + ^21+1, 
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where j e {0, 1, . . . , |} is (m — l)/2 or m/2 as m is odd or even, respectively. Suppose 
that m is odd. The sum then becomes 

2ti -2^2 + 2^3 t2j = 2ti - 2{ti + Si) + 2^3 {t2j-l + - Sj_i) 

= hj-i - Sj - Sj_i. 
Using the definition of ^2^+1 with j replaced by j — 1, we may write 

^2j-l ~ Sj — Sj-i = t2j-2 + ■Si-j+2 ~ ~ Sj — Sj^i- 

Using again the definitions above, we see that, for any integer /i e {0, . . . , j — 1}, 

t2j-2 + Si-j+2 - Si-j+i - Sj - Sj-i = t2{j-h}-2 + " Si-j+l " Sj - Sj-(/i+l) 

this equation holds trivially ii h — 0, and, if /i e {0, . . . , j — 2}, we have 

^2(j-/i)-2 + Si^[j-(h+l))+l — Sj_j_|_i — Sj — Sj-(/i+l) = 

hij-h-i)-i + Sj-h-i + + Si_(j_(/i+i))+i — — — Sj_(fe+i) = 

^2{j-h-2) + Si_(j_/i_2)+l — Si_(j_/i_2) + — 'S(j_/i_i)_i + Si_(j_(/j+i))+i 

— Si-j+i — Sj — Sj_(/j+i) — 

h{j-{h+l))-2 + Si_(j_((^+i)+i))+i - Si_j+i - Sj - Sj_((fe+i)+i). 

In particular, letting h — j — 2 and recalling that Sj+i = 7 and sq = 0, we see that 

t2j-2 + Si-j+2 — — — Sj-i = 7 — Si_j+i — Sj. 

Because {si}]tl ^ ^hi^ 7 - Si-j+i - Sj > 0. 

Suppose now that m is even. We are then interested in the sum 

2ti — 2^2 + 2^3 — . . . + t2j+l- 

Prom the analysis in the case of m odd, we see that we may write 

2ti — 2^2 + 2^3 — . . . + t2j+l = t2j-l ~ Sj — Sj_i — t2j + t2j+l 

= hj-l — Sj — Sj_i — t2j + t2j + Sj-j+l — Si-j 
^2j—\ Sj Sj — \ + Sj_j_|_i Si—j. 

Using the same method as before, we see that, for any integer /i e {0, . . . , j — 1}, 

t2j-l — Sj — Sj-i + Sj-j+i — Si-j = t2{j-h)-l + Si-(^j-h)+l — Si-j — Sj — Sj_(/i+i). 

In particular, letting h — j — 1, the right side of this equality reduces to 

t\ ~\~ Si Si-j Sj. 
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Using the definition of ti, this becomes 

^j+l ~l~ ^i—j ^i—j 

Since Si-j < Sj-j+i and 7 — — sj > 0, it follows that 7 — Si^j — sj > 0. 

It remains to be shown that tj+i = 7. We have 

hj+i — hj + Si-j+i — Si^j 

— ^2j-l + Sj — Sj-i + Sj_j4-i — Si-j 

— ^2j-2 + Si-j+2 + Sj — Sj-i — Si-j. 

In the same way as above, one sees that, for any integer /i e {1, . . . , j}, 

^2j-2 + Si-j+2 + Sj — Sj-i — Si-j — t2{j-h) + — Sj-h + Sj — Si-j. 

In particular, letting J = | and h = j, the right side of the preceding equality simplifies to 
Sj+i. By hypothesis, Sj+i = 7. Thus, tj+i = 7. This completes the case of i even. 

Suppose now that i is odd. For j e {0, 1, ... , define f2j+i as above. For j e 
{0, 1, . . . , ^}, define as above. The proof then proceeds as in the case of i even. 

We thus obtain a well-defined map / from Si^^ to 7],^: if {s;}^^q G S'j^^, let /({sJJ^q) = 
{tiYi^Q, where each ti is defined via the equations at the beginning of the proof. It follows 
immediately from the definition of the ti that / is injective. Thus, to show |5'j,-y| = |Tj^-y|, it 
will suffice to construct a similar injective map from Tj ,y to Si^^. 

Suppose first that i is even. Let {t/};=o ^ ^2,7- -^^^ J ^ {0; 1) • • • ) |}) 'we make the 
recursive definitions 

Si-j :— Si-j+i — t2j+l + t2j. 

Note that sq = to = 0. Then, using these definitions, we have that 

Sj + Si-j^i — Sj-i + t2j — t2j-l + Si-j^2 + ^2j-2 ~ ^2j-l 

— Sj-2 + ^2j-2 ~ ^2j-3 + t2j — t2j-l + Si-j+s + ^2^-4 ~ ^2j-3 + ^2j-2 ~ ^2j-l 



= 7-2ti + 2t2 ht2i. 
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Because {tiYi^o ^ 2ii — 2^2 + ■ ■ ■ — t2j > 0. Thus, sj + Sj-j+i < 7. It remains to be 
shown that Sj+i = 7. Prom the definition of Sq, it is clear that Sq — 0. We have that 

^i+l = Si + ti 

= Si-i +t3-t2 + ti 

= Si-2 + — + — t2 + ti 



We also have that 



— Si + ti^i — ti + — tj_2 + • • • + ^3 — ^2 + ^1- 



Si — Stil + ~ ti-i 
2 2 

= Si^ + ti-2 — ti-3 + ti — ti-i 
2 



— — ^1 + ^2 ^ ^3 + ■ ■ ■ + ~ ti-i + ti — 

Thus, Sj+i = tj+i. Since tj+i = 7 by hypothesis, it follows that Sj+i = 7. 

Suppose now that i is odd. For j G {0, 1, ... , ^}, define as above. For j e 
{0, 1, . . . , define Si-j+i as above. The proof then proceeds as in the case of i even. 

We thus obtain a well-defined map g from Ti^^ to Si^^: if {ti}]to ^ ^1,7; ^^t ^'({tjjlo) = 
{sz};io' where each si is defined via the equations above. It follows immediately from the 
definition of the si that g is injective. Thus, \Si^^\ — iTi^-y]. □ 

We need one more technical lemma before computing the upper bound: 

Lemma 5.2 Let 7 be a positive integer. Then, Ui<i<j<'y = 

Proof: The proof is by induction. The claim is clearly true if 7 = 1. Suppose that 
ni<.<,<, - ^2 f for 7 > 1- Then, 

-|-r 2 + i+j ^ 2 + i + j -|-|- i + 7 + 3 ^ 

l<i<j<7+l l<i<j<7 l<i<7+l ' 

1 (27 + 2)(27 + l)---l (7 + 4)(7 + 5)---(27 + 4) ^ 

7 + 2(7 + l)(7)---l(7 + l)(7)---l(7 + 2)(7 + 3)---(27 + 2) " 

(27 + 2) (27 + 1) ■ ■ ■ (7 + 3) (7 + 4) (7 + 5) ■ ■ ■ (27 + 4) _ 
(7 + l)(7)---l (7 + 2)(7 + 3)---(27 + 2) 

1 (27 + 4)(27 + 3)---(7 + 3) 1 (27 + 4)(27 + 3) ■ ■ ■ 1 



7 + 3 (7 + 2)(7 + 1) • • • 1 7 + 3 (7 + 2)(7 + 1) • • • 1(7 + 2)(7 + 1) 

1 /27 + 4^ 1 /2((7 + 1) + 1)^ 

7 + 3V7 + 2y (7 + l) + 2V (7 + l) + l 

This proves the claim. □ 
We are now ready to prove the main result of this section: 
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Theorem 5.3 Let 7 e N. Let denote the ^th Catalan number: Cj :— ;;^(^^). Let F 
be a finite Geld such that |F| > |(C^_i + (j^^j))- Then, there exists 37x7 superregular 
matrix over F. 



Proof: We first upper bound the number of submatrices of a 7 x 7 lower triangular Toeplitz 
matrix that possess a certain property and show that this upper bound is given by the 
expression in the statement of the theorem. We then observe that the number thus obtained 
is actually an upper bound on the minimal size a finite field F must have in order that a 
7x7 superregular matrix over F can exist. 

For convenience, wc drop the matrix entries with index 0. Thus, a 7 x 7 lower triangular 
Toeplitz matrix X with indeterminate entries is now defined by a first column of the form 



[xi X2 ■ ■ ■ X 



7J ■ 

The determinants of the proper square submatrices of such a matrix are given by nonzero 
polynomials in these indeterminates. Notice that can appear to at most the first power 
in any of these polynomials; in other words, each of these polynomials has the property that 
either it is linear in x^, or x^ docs not appear in any of its terms. We are interested in those 
proper square submatrices whose determinants are linear in Xj, and we denote the set of 
such submatrices by Lj. 

First, we observe that L^ consists of the single entry x^ — X^ and the submatrices 
^ii i2,...^ 1,7^ where s e {2, . . . , 7 - 1}, where < for all 1/ e {1, 2, . . . , s - 1}. That X] 
is the only 1x1 submatrix in L-y is obvious; to see the second part, evaluate '^Q^X^^'^^'_\\''-^~^'^ 
by doing a cof actor expansion along the first column of j5^*ij*2v -,*s-i,7 rpj^g indeterminate 
x-y appears if and only if detXjJ'*^'''''*'^^^ ^ 0. This is the case if and only if -^j^j^i .-jCi 
is a proper submatrix, and X*,^'*^' is a proper submatrix if and only if j^, < for all 
z/G {l,2,...,s- 1}. 

Next, wc observe that is closed with respect to transpose about the antidiagonal of 
X. In order to see this, note that the transpose submatrix -^ij*^.'.'. 

about the antidiagonal is given by X^^^zt:''^ = ^Z^^t't+C.^-Si"'^- Clearly, 7-i. + l < 
7 — ^,^ + 1 <^=^ ji, < iy. Of course, £ L.^ is symmetric with respect to the 

antidiagonal of X if and only if x^f^^-f-^'^ = xji.i2...,wi,7 ^/ ^ r denote those 

elements of L^ that are symmetric with respect to the antidiagonal. 

Finally, because X is symmetric with respect to the antidiagonal, taking the transpose 
^11,12, ...,«s-i,7 ^ square submatrix x]^f'^''"f"~^''^ about the antidiagonal of X amounts to 

taking the transpose of about its own antidiagonal. Since the determinant of a 

matrix is the same as the determinant of its transpose about its antidiagonal, we have that 
detX!Y^'-f-^'^= detX!^f'-f-^'^ 

By definition, the determinants of the elements of L^ are polynomials linear in x^. We 
wish to determine how large a finite field must be in order to guarantee that nonzero field 
elements may be substituted for such a way that none of these determinants 
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is zero. We do this by computing an upper bound A'^ on the number of distinct polynomials 
giving the determinants of the elements of L^. As long as the order of the field is bigger 
than A^^, it is clearly possible to make all of these determinants nonzero. From the above 
observations, we see that, once we have computed \L^\ and \L'^\, we may take as an upper 
bound ■= \{\L^\ + |L;|). 

As seen above, the s x s submatrices in L^, s G {2, ... ,7 — 1}, are described by sets 
{ii,i2, • • • ,is-i,ji,j2, ■ ■ ■ , js-i} of indices, where j„ < iy for all G {1, 2, . . . , s - 1}, 1 < Zi < 
. . . < is-i < 7, and 1 < ji < . . . < j^-i < 7 (and the 1x1 submatrix is associated with 
an empty set of indices). Each nonempty index set defines a generalized Young tableau with 
columns having height 2 and integer entries in {2,. ..,7 — 1}; conversely the entries of such 
a tableau constitute a set of indices corresponding to a submatrix in L^. The empty set of 
indices corresponds to a tableau with all columns having height 0. In j2], it is shown that 
the number of such tableaux is given by ni<j<j<7-2 ^^TJ^' Lemma this product is 
C^-i. Consequently, \L^\ = C^-i. 

We compute \L'^\ as follows. If the submatrix -^ij'i^.'.'.'j^"^''^ ^ L'^, where s G {2, ... ,7 — 1}, 
then, since = -^i|ji^...j^Zj'''^5 it must be that 'j — iu = js-u — 1 for all u G 

{l,2,...,s — 1}. Thus, this submatrix is completely determined by the s — 1 integers Wi = 
ji - 1,^2 = j2 - 1, • • ■,Ws-i = js~i - 1- Since Xl^jl^'-'-'l"'^^ ^ ^7' ^^^^ ji = wi + < 
7 — Ws-i = ii, I E {I, . . . , }• These inequalities can be rewritten as wi +Ws~i-i < 7 — 1, 
/ G {1, . . . , f^]}. In other words, {0,wi, . . . ,Ws-i,'y — 1} is a sequence that belongs to 
Ss-i,j-i- Thus, to each s x s submatrix in L'^, where s G {2, . . . , 7 — 1}, we have associated 
a unique sequence in Ss-i^-y-i- Similarly, to each sequence in Ss-i,-^/-i, we can associate a 
unique s x s submatrix in L^^. If s = 1, there is only the submatrix to consider, and we 
have already associated this submatrix with the empty sequence, which in turn corresponds 
with the sequence {0,7 — 1}, the sole member of S'0,7-1. Thus, \L'^\ = J2y=o \Sy,-y-i\- From 
Lemma Em we know that J2y=o \Sy,'y-i\ = Y11=o l^y,7-i|- It sufficient, then, to compute 

Z^y=0 Kj/,7-l|- 

Suppose {ti}f=o ^ ^s-i^^-i. To this sequence, we can associate a nonnegative planar 
walk of length 7 — 1 with s + 1 vertices. The walk begins at the origin, and steps are given 
by the vectors (1,1) and (1,-1). The vertices are the origin, the endpoint of the walk, 
and the points where the direction of the walk changes from up to down or from down to 
up. We make the association in the following way: let ti be the x-coordinate of the ith 
vertex. It is clear that this walk is nonnegative, as the condition defining membership in 
Ts_i guarantees nonnegativity of the associated walk. Conversely, the x-coordinates of 
the s + 1 vertices in a nonnegative planar walk of length 7 — 1 form a sequence in Ts_i^^_i. 
Therefore, this association gives a bijective correspondence between sequences in UyZ^Ty^^^i 
and nonnegative planar walks of length 7 — 1. It is a fact (see, for example, [S]) that the 

number of nonnegative planar walks of length 7 — 1 is given by d^^li)- This means that 

L 2 J 

Ey=o l^s/,7-il = (ilZii)- Consequently, \L'^\ = iViX 
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It remains to show that, in fact, a field of order bigger than elements is large enough 
so that a superregular 7x7 matrix can exist over it. This may be easily seen in the following 
way: the determinant of each submatrix in Si, I & {1, . . . , 7}, is linear in xi. Ni increases with 
I. Thus, if we work over a field of order bigger than elements, it is possible to successively 
substitute nonzero field elements for xi,X2, ■ ■ ■ in such a way that the determinant of each 
submatrix in 5*^, / G {1, . . . , 7}, is nonzero. This completes the proof. □ 

Using computer searches, we were able to determine the exact minimum field size for 
small values of 7. These may be seen in Table 1. We observe that the upper bound + 1 
appears to grow much more quickly than necessary: 



7 


Minimum Field Size Required 


Upper Bound (A''^ +1) 


3 


3 


3 


4 


5 


5 


5 


7 


11 


6 


11 


27 


7 


17 


77 


8 


31 


233 


9 


59 


751 


10 


< 127 


2495 



Table 1: Comparison of Actual Required Field Size and A^^ + 1 



It is still an open problem as to how this bound may be refined. Computer searches lead 
us to make the following conjecture; if true, it would offer a significant improvement to the 
bound given here: 

Conjecture 5.4 For 7 > 5, there exists 37x7 superregular matrix over the held F27-2. 

6 For Future Research: Finding a Construction 

At this point, little is understood about how to construct superregular matrices. The problem 
of finding constructions appears to be a very hard one. One must find a way of guaranteeing 
that all proper submatrices with any number of zeroes above the diagonal have a nonzero de- 
terminant and do so with additional constraints coming from the Toeplitz structure. In [20j, 
a method is given for constructing, for any prime number p, a triangular Toeplitz array of 
dimension p having the property that all full square submatrices have a nonzero determi- 
nant. Unfortunately, there is no way of extending this construction to the much more general 
situation we consider here. 

In jH], the following result is proven: 
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Theorem 6.1 Let X be the 7x7 matrix given by 



X :-- 



1 
1 




Then, 



^7-1 



1 

CD 




1 
1 



••• 




1 

CT") 



1 





1 
1 1 



••• (V) 





1 



where (2)- 



7! 



tj!{7— a;)! 



, is totally positive over the real numbers. Thus, for a sufhciently large 
prime number p, taking the entries of this matrix modulo p gives a superregular matrix. 

This result gives a construction insofar as one knows that, modulo a large enough prime 
number, the matrix X"^^^ is superregular. It is not clear, however, how one may give a good 
bound as to how large p must be for a given 7. 
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